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CODE CONVERSION METHOD AND APPARATUS 

TECHNICAL FIELD: 
5 The present invention relates to an encoding and decoding method 

for transmitting or storing a speech signal at low bit rates, and more 
particularly, to a code conversion method and apparatus for converting, in a 
high sound quality and with a small amount of calculations, codes generated 
by encoding a speech in accordance with a certain scheme to codes which 
10 can be decoded in accordance with another scheme. 
BACKGROUND ART: 

As a method of efficiently encoding speech signals at middle bit rates 
or low bit rates, one widely used method separates a speech signal into an 
LP (Linear Prediction) filter and an excitation signal for driving it and then 

1 5 encodes the speech signal. One representative method is CELP (Code 
Excited Linear Prediction). CELP drives an LP filter, which has set therein 
LP coefficients representative of frequency characteristics of an input speech, 
with an excitation signal represented by the sum of an adaptive codebook 
(ACB) representative of the pitch period of the input speech and a fixed 

20 codebook (FCB) made up of a random number and a pulse to generate a 
synthetic speech signal. In this event, an ACB component and an FCB 
component are multiplied by gains (ACB gain and FCB gain), respectively. 
For CELP, see, for example, M. Schroeder, "Code excited linear prediction: 
High quality speech at very low bit rates," Proc. of IEEE Int. Conf. on Acoust., 

25 Speech and Signal Processing, pp.937-940, 1985. 

Assuming, for example, an interconnection between a 3G (Third 



Generation) mobile network and a wired packet network, a problem arises in 
that these networks cannot be directly connected because the respective 
networks employ different standard speech encoding scheme. As a 
solution to this, a tandem connection can be contemplated. 

5 FIG. 1 illustrates an example of a conventional code conversion 

apparatus based on the tandem connection, where codes generated by 
encoding a speech using a first speech coding scheme are converted into 
codes which can be decoded in accordance with a second speech coding 
scheme. The second speech coding scheme is generally different from the 

10 first speech coding scheme. In the following, for simplicity of description, 
the first speech coding scheme is simply called "Scheme 1," and codes 
generated by encoding a speech using the first speech coding scheme is 
called "first code string data." Likewise, the second speech coding scheme 
is simply called "Scheme 2," and codes generated by encoding a speech 

15 using the second speech coding scheme is called "second code string data." 
Assume that code string data is communicated at a frame period (for 
example, a period of 20 milliseconds) which is the processing unit of speech 
encoding/decoding. For a speech encoding method and decoding method, 
see the aforementioned Schroeder-s article, or 3GPP standard: "AMR 

20 Speech codec: Transcoding functions" (3GPP TS 26.090). 

Referring to FIG. 1 , the following description will be given of a 
conventional code conversion apparatus based on the tandem connection. 

In the code conversion apparatus, input terminal 10, speech decoding 
circuit 1050, speech encoding circuit 1060, and output terminal 20 are 

25 connected in series in this order. Speech decoding circuit 1 050 decodes a 
speech from first code string data applied thereto through input terminal 10 
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by a decoding method conforming to Scheme 1 , and supplies the decoded 
speech to speech encoding circuit 1 060 as a first decoded speech. Speech 
encoding circuit 1060 receives the first decoded speech delivered from 
speech decoding circuit 1050, and delivers code string data, generated by 
5 encoding the first decoded speech by a second speech coding method, 
through output terminal 20 as second code string data. 

However, the foregoing conventional code conversion apparatus 
based on the tandem connection re-encodes a decoded speech signal, 
generated by once decoding applied first code string data by the speech 
10 decoding circuit of Scheme 1 , as it is by the speech encoding circuit of 
Scheme 2 even though its signal characteristics are not suitable for re- 
encoding due to a deterioration resulting from the coding, and therefore has 
a challenge that the speech quality deteriorates in a finally decoded speech if 
the second code string data generated by these code conversions is 
15 decoded in accordance with Scheme 2. 

DISCLOSURE OF THE INVENTION: 
It is an object of the present invention to provide a code conversion 
method for decoding and re-encoding an encoded speech, which is capable 
of reducing a deterioration in speech quality of a finally generated speech 
20 signal. 

It is another object of the present invention to provide a code 
conversion apparatus for decoding and re-encoding an encoded speech, 
which is capable of reducing a deterioration in speech quality of a finally 
generated speech signal. 
25 The first object of the present invention is achieved by a code 

conversion method for converting first code string data conforming to a first 
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speech coding scheme into second code string data conforming to a second 
speech coding scheme. The method has the steps of decoding the first 
code string data to generate a first decoded speech, correcting the signal 
characteristics of the first decoded speech to generate a second decoded 
5 speech, and encoding the second decoded speech in accordance with the 
second speech coding scheme to generate the second code string data. 

In the code conversion method of the present invention, in the step of 
generating the second decoded speech, the signal characteristics are 
preferably corrected by a filter having characteristics which vary in 
10 accordance with the characteristics of the first decoded speech. Also, in the 
step of generating the second decoded speech, the signal characteristics of 
the first decoded speech are preferably corrected into signal characteristics 
suitable for re-encoding. 

The second object of the present invention is achieved by a code 
1 5 conversion apparatus for converting first code string data conforming to a 
first speech coding scheme into second code string data conforming to a 
second speech coding scheme. The code conversion apparatus has a 
speech decoding circuit for decoding the first code string data to generate a 
first decoded speech, a signal characteristic correcting circuit for correcting 
20 signal characteristics of the first decoded speech to generate a second 
decoded speech, and a speech encoding circuit for encoding the second 
decoded speech in accordance with the second speech coding scheme to 
generate the second code string data. 

In the code conversion apparatus of the present invention, the signal 
25 correcting circuit preferably corrects the signal characteristics of the first 
decoded speech into signal characteristics suitable for re-encoding to 
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generate the second decoded speech. Also, the signal characteristic 
correcting circuit preferably corrects the signal characteristics of the first 
decoded speech using a filter having characteristics which vary in 
accordance with the characteristics of the first decoded speech to generate 
5 the second decoded speech. 

In the present invention, the filter used for correcting the signal 
characteristics of the first decoded speech is preferably an inverse filter to a 
post filter, an emphasis filter having characteristics for emphasizing high- 
band components of frequency, or a filter which is a combination of the two. 
10 Also, the filter characteristics are preferably varied using at least one of 
frame type information included in the first code string data, the size of the 
first code string data, and a characteristic amount which can be calculated 
from the first decoded speech. 

A decoded speech signal generated by decoding by a speech 
15 decoding circuit of Scheme 1 generally has signal characteristics which are 
not suitable for re-encoding due to a deterioration resulting from the coding. 
When the decoded speech signal is re-encoded as it is by a speech 
encoding circuit of Scheme 2, a degradation in sound quality is prominent in 
a speech signal decoded from second code string data after the code 
20 conversion. In the present invention, the first code string data is decoded 
from the first code string data by the speech decoding circuit of Scheme 1 to 
generate a decoded speech signal, the signal characteristics of which are 
corrected, and subsequently, the corrected decoded speech signal is re- 
encoded by the speech encoding circuit of Scheme 2. As a result, 
25 according to the present invention, the deterioration in sound quality is 
reduced in a speech signal decoded from the second code string data. 
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BRIEF DESCRIPTION OF THE DRAWINGS: 
FIG. 1 is a block diagram illustrating the configuration of a 

conventional code conversion apparatus based on a tandem connection; 
FIG. 2 is a flow chart showing a processing procedure of a code 

conversion based on the present invention; 

FIG. 3 is a block diagram illustrating the configuration of a code 

conversion apparatus according to a first embodiment of the present 

invention; 

FIG. 4 is a block diagram illustrating the configuration of a code 
conversion apparatus according to a second embodiment of the present 
invention; and 

FIG. 5 is a block diagram illustrating another exemplary configuration 
of a code conversion apparatus based on the present invention. 

BEST MODE FOR CARRYING OUT THE INVENTION: 

FIG. 2 shows the flow of processing based on a code conversion 
method of the present invention. The code conversion method based on 
the present invention has the following steps (a) to (c): 

(a) : generating a first decoded speech from first code string data by a 
decoding method of Scheme 1 (step S101); 

(b) : correcting the first decoded speech to have signal characteristics 
suitable for re-encoding using a filter to generate a second decoded speech 

(steps S102, 103); and 

(c) encoding the second decoded speech by a second encoding 
method to generate second code string data (step S104). 

Thus, in the present invention, a decoded speech signal generated by 
decoding the first code string data by the speech decoding circuit of Scheme 



1 is corrected using a filter to have signal characteristics suitable for re- 
encoding, and the corrected decoded speech signal is re-encoded by the 
speech encoding circuit of Scheme 2. It is therefore possible to reduce a 
speech quality deterioration in the speech signal decoded from the second 
5 code string data after the code conversion, caused by re-encoding the 

decoded speech having signal characteristics unsuitable for re-encoding due 
to a deterioration due to the encoding, as it is, by the speech encoding circuit 
of Scheme 2. 

Next, description will be given of a code conversion apparatus based 
10 on the present invention. In FIG. 3 which illustrates a code conversion 

apparatus according to a first embodiment of the present invention, elements 
identical or similar to those in FIG. 1 are designated the same reference 
numerals. 

The code conversion apparatus illustrated in FIG. 3 comprises input 
15 terminal 10; speech decoding circuit 1050 which is supplied with first code 
string data from input terminal 10; signal characteristic correcting circuit 2070 
which is supplied with the output of speech decoding circuit 1050; speech 
encoding circuit 1060 which is supplied with the output of signal 
characteristic correcting circuit 2070; and output terminal 20 for delivering 
20 second code string data generated from speech encoding circuit 1060 to the 
outside. Speech decoding circuit 1 050 generates a first decoded speech 
from the first code string data by a decoding method of Scheme 1 . Signal 
characteristic correcting circuit 207 corrects the first decoded speech to have 
signal characteristics suitable for re-encoding using a filter to generate a 
25 second decoded speech. Speech encoding circuit 1060 encodes the 

second decoded speech by a second encoding method to generate second 
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code string data. Input terminal 10, output terminal 20, speech decoding 
circuit 1050, and speech encoding circuit 1060 are the same as those 
illustrated in FIG. 1 . 

In the following, a detailed description will be given of signal 
5 characteristic correcting circuit 2070 which is a difference in configuration 
between the code conversion apparatus illustrated in FIG. 3 and the 
conventional code conversion apparatus illustrated in FIG. 1 . 

Signal characteristic correcting circuit 2070 receives the first decoded 
speech delivered from speech decoding circuit 1050, and applies speech 
10 encoding circuit 1060 with a signal generated by driving a filter represented 
by transfer function F(z) with the first decoded speech, as a second decoded 
speech. Here, filter F(z) has such signal characteristics that correct the first 
decoded speech to have signal characteristics suitable for re-encoding. 

In many cases, a post filter is employed in a speech decoding circuit 
15 for improving a subjective sound quality, but the sound quality deteriorates if 
a post-filtered decoded speech is re-encoded. Thus, the sound quality can 
be improved by applying the decoded speech to a filter inverse to the post 
filter. Filter F(z) can be expressed by Equation (1 ) when the transfer 
function of the post filter is P(z): 
20 F(z) = F1(z) = 1/P(z) (1) 

Here, for details on the post filter, see, for example, a description in 
3GPP TS 26.090, Section 6.2. 

Also, in the aforementioned deterioration in sound quality, muffled 
feeling of sound often constitutes a significant factor. As such, filter F(z) 
25 may be a filter which has such frequency characteristics that emphasize 
high-band components of frequency. In this event, F(z) can be expressed, 
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for example, by Equation (2): 

F(z) = F2(z) = 1-u(1/z) (2) 
where u is a coefficient (for example, 0.2) which represents the degree of 
emphasis for high-band components. 
5 Further, the aforementioned F1(z) and F2(z) may be combined. In 

this event, F(z) can be expressed by Equation (3): 

F(z) = F3(z) = F7(z) F3(z) = (1 -u(1 /z))/F(z) (3) 
As is apparent from the foregoing, this embodiment is advantageous 
in that a speech decoding circuit and a speech encoding circuit, conforming 
10 to a standard scheme, can be utilized as they are because there is no need 
for adapting a speech decoding circuit and a speech encoding circuit which 
form part of a conventional code conversion circuit. 

Next, a description will be given of a code conversion apparatus 
according to a second embodiment of the present invention. In this second 
1 5 embodiment, the filter characteristics of the signal characteristic correcting 
circuit in the code conversion apparatus of the aforementioned embodiment 
are made variable in accordance with the characteristics of a speech signal. 
In FIG. 4 which illustrates the code conversion apparatus of the second 
embodiment, elements identical or similar to those in the third embodiment 
20 are designated the same reference numerals. 

As illustrated in FIG. 4, in the code conversion apparatus of the 
second embodiment, speech decoding circuit 1050 shown in FIG. 3 can be 
regarded as being composed of code separation circuit 3010 and speech 
decoding circuit 3050. Likewise, speech encoding circuit 1 060 shown in 
25 FIG. 3 is regarded as being composed of code multiplexing circuit 3020 and 
speech encoding circuit 3060. 
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Code separation circuit 3010 separates a header and a payload from 
first code string data applied thereto through input terminal 1 0. The header 
includes frame type information. By referencing the frame type information, 
it is possible to distinguish whether a signal decoded from the code string 

5 data corresponds to a speech section or a silent section. Here, for details 
on the frame type information, see, for example, 3GPP standard: "AMR 
Speech codec frame structure" (3GPP TS 26.1 01 ). The payload contains 
codes corresponding to speech parameters. The speech parameters in 
code string data include, for example, an LP coefficient, ACB, FCB, ACB, 

10 and gains (ABC gain and FCB gain). Codes corresponding to the LP 
coefficient, ACB, FCB, and gains are designated by a first LP coefficient 
code, a first ACB code, a first FCB code, and a first gain code, respectively. 
Code separation circuit 3010 delivers the frame type information to signal 
characteristic correcting circuit 3070, and delivers the first LP coefficient 

15 code, first ACB code, first FCB code, and first gain code to speech decoding 
circuit 3050. 

Speech decoding circuit 3050 receives the first LP coefficient code, 
first ACB code, first FCB code, and first gain code delivered from code 
separation circuit 3010, decodes a speech from these codes by a decoding 

20 method of Scheme 1 , and delivers the decoded speech to signal 
characteristic correcting circuit 3070 as a first decoded speech. 

Speech encoding circuit 3060 receives the second decoded speech 
delivered from signal characteristic correcting circuit 3070, and encodes the 
second decoded speech by a second encoding method to generate an LP 

25 coefficient code, an ACB code, an FCB code, and a gain code. Then, these 
codes are delivered to code multiplexing circuit 3020 as a second LP 



10 



coefficient code, a second ACB code, a second FCB code, and a second 
gain code, respectively. 

Code multiplexing circuit 3020 receives the second LP coefficient 
code, second ACB code, second FCB code, and second gain code delivered 
5 from speech encoding circuit 3060, and multiplexes them to generate code 
string data which is delivered through output terminal 20 as second code 
string data. 

Signal characteristic correcting circuit 3070 receives the first decoded 
speech delivered from speech decoding circuit 3050, and the frame type 
10 information delivered from code separation circuit 3010, and delivers a signal, 
generated by driving a filter represented by transfer function F(z), which is 
variable in accordance with the frame type information, with the first decoded 
speech, to speech encoding circuit 3060 as a second decoded speech. 
Here, as is the case with the first embodiment, filter F(z) can be 
15 expressed by the following equations when a post filter in speech decoding 
circuit 3050 has a transfer function P(z) represented by P(z). 

When the frame type information corresponds to a speech, filter F(z) 
is expressed by Equation (4): 

F(z) = F1(z) = 1/P(z) (4) 
20 When the frame type information corresponds to non-speech, filter 

F(z) is expressed by Equation (5): 

F(z) = F1{z) = 1 (5) 
When filter F(z) is a filter which has such frequency characteristics 
that emphasize high-band components of frequency, F(z) can be expressed, 
25 for example, by the following equations. 

When the frame type information corresponds to a speech, filter F(z) 
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is expressed by Equation (6): 

F(z) = F2(z) = 1-u(1/z) (6) 
When the frame type information corresponds to non-speech, filter 
F(z) is expressed by Equation (7): 
5 F(z) = F2(z) = 1-v(1/z) < 7 > 

where u, v are coefficients which represent the degrees of emphasis on high- 
band components, and for example, u=0.2, and v=0.1 . Further, F1(z) and 
F2(z) may be combined. In this event, F(z) can be expressed by the 
following equations. 

10 When the frame type information corresponds to a speech, filter F(z) 

is expressed by Equation (8): 

F(z) = F3(z) = F1(z) F2(z) = (1-u(1/z))/P(z) (8) 
When the frame type information corresponds to non-speech, filter 
F(z) is expressed by Equation (9): 
15 F(z) = F3(z) = F1(z) F2(z) = 1-v(1/z) (9) 

In the example described above, while the frame type information is 
employed for making the filter characteristics variable in accordance with the 
characteristics of a speech signal, the size of the first code string data may 
be employed instead of the frame type information, or a characteristic 
20 amount, which can be calculated from the first decoded speech, can be used. 
The characteristic amount represents the characteristics of a speech signal, 
and includes, for example, pitch periodicity, gradient of spectrum, power, and 
the like. Filter characteristics F(z) may be varied in a manner similar to the 
foregoing example when the characteristic amount corresponds to a speech 
25 and when the characteristic amount corresponds to non-speech. 

For example, when the power is considered as the characteristic 
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amount, it is contemplated, as the most simple example, to correspond 
relatively large power to a speech and to correspond small power to non- 
speech. 

When power E corresponds to a speech, filter F(z) is expressed by 
Equation (10): 

F(z) = F3{z) = F1(z)F2(z) = (1-u(Vz))IP(z). E>Th (10) 

When power E corresponds to non-speech, filter F(z) is expressed by 

Equation (11): 

Fiz) = F3(z) = F1(z)F2{z)=Wz), E<Th (11) 
where Th is a certain constant. Also, coefficients u, v may take continuous 
values as functions of E. 

Each of the code conversion apparatuses described above may be 
implemented by computer control such as a digital signal processor (DSP). 
FIG. 5 schematically illustrates the configuration of the apparatus when the 
code conversion processing in each of the aforementioned embodiments is 
implemented by a computer. 

In computer 100 for executing a program read from recording medium 
600, for executing code conversion processing for converting a first code 
generated by encoding a speech by a first encoding/decoding apparatus into 
a second code which can be decoded by a second encoding/decoding 
apparatus, recording medium 600 has recorded thereon a program for 
executing (a) processing for generating a first decoded speech from first 
code string data by a decoding method of Scheme 1 ; (b) processing for 
correcting the first decoded speech to have signal characteristics suitable for 
re-encoding using a filter to generate a second decoded signal; and (c) 
processing for encoding the second decoded speech by a second encoding 
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method to generate second code string data. 

This program is read from recording medium 600 into memory 300 
through recording medium reader 500 and interface 400. The program may 
be stored in a non-volatile memory such as ROM, flash memory or the like, 
whereas the recording medium may include, other than a non-volatile 
memory, media such as CD-ROM, FD, Digital Versatile Disk (DVD), 
magnetic tape (MT), and portable hard disk drive (HDD). Further, such a 
program may have been provided in a server device such that the program is 
downloaded to a computer through a communication network. Other than a 
recording medium which has recorded thereon such a program, the scope of 
the present invention includes a program product which comprises such a 
program, a communication medium which carries such a program for wired 
or wireless transmission, and the like. 



14 



